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SEQUENCE LISTING 




1) GENERAL INFORMATION: 

(i)\APPLICANT: Jaye, Michael C. 

Doan, Kim-Anh T. 
Krawiec, John A. 
Lynch, Kevin J. 
Amin, Dilip V. 
South, Victoria J. 
Marchadier, Dawn 
L Maugeais, Cyrille 
iader, Daniel J. 

(ii) TITLE OF INVeWiON: COMPOSITIONS AND METHODS FOR EFFECTING THE 
LEVELS OF HPGH DENSITY LIPOPROTEIN (HDL) CHOLESTEROL AND 
APOLIPOPROTEIN AI, VERY LOW DENSITY LIPOPROTEIN (VLDL) 
CHOLESTEROL ANDVLOW DENSITY LIPOPROTEIN (LDL) CHOLESTEROL 

(iii) NUMBER OF SEQUENCES: 31 

(iv) CORRESPONDENCE ADDRESS 

(A) ADDRESSEE: Synnes^vedt & Lechner LLP 

(B) STREET: Suite 2600 /^ramark Tower, 1101 Market Street 

(C) CITY: Philadelphia 

( D ) STATE : PA 

(E) COUNTRY: USA 

(F) ZIP : 19107 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-Windows 9< 

(D) SOFTWARE: Corel WordPerfect 8 . 0 \converted to ASCII 
(vi) CURRENT APPLICATION DATA: 
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(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 
(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Kelly, Ph.D., Patrick J. 

(B) REGISTRATION NUMBER: 34,638 

(C) REFERENCE /DOCKET NUMBER: 22,944 USA 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (215)923-4466 

(B) TELEFAX: (215)923-2189 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 22,. 180 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAATTCGGCT TGATCAATCG C TTC AAA AAG GGG ATC TGT CTG AGC TGC CGC 51 

Phe Lys Lys Gly lie Cys Leu Ser Cys Arg 
15 10 
AAG AAC CGT TGT AAT AGC ATT GGC TAC AAT GCC AAG AAA ATG AGG AAC 99 
Lys Asn Arg Cys Asn Ser lie Gly Tyr Asn Ala Lys Lys Met Arg Asn 

15 c 20 25 

AAG AGG AAC AGC AAA ATG TAC CTA AAA ACC CGG GCA GGC ATG CCT TTC 147 
Lys Arg Asn Ser Lys Met Tyr Leu Lys Thr Arg Ala Gly Met Pro Phe 
30 35 40 
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AG A GGT AAC CTT CAG TCC CTG GAG TGT CCC TGA GGAAGGCCCT TAATACCTCC 200 
Arg Gly Asn Leu Gin Ser Leu Glu Cys Pro * 
45 50 

TTCTTAATAC CATGCTGCAG AGCAGGGCAC ATCCTAGCCC AGGAGAAGTG GCCAGCACAA 260 
TCCAATCAAA TCGTTGCAAA TCAGATTACA CTGTGCATGT CCTAGGAAAG GGAATCTTTA 320 
CAAAATAAAC AGTGTGGACC CCTCAAAAAA AAAAAAAAGC CGAATTC 367 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Phe Lys Lys Gly lie Cys Leu Ser Cys Arg Lys Asn Arg Cys Asn Ser 

15 10 15 

lie Gly Tyr Asn Ala Lys Lys Met Arg Asn Lys Arg Asn Ser Lys Met 

20 25 30 

Tyr Leu Lys Thr Arg Ala Gly Met Pro Phe Arg Gly Asn Leu Gin Ser 

35 40 45 

Leu Glu Cys Pro * 
50 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1382 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 312. .1370 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAATTCGGCT TCTACTACTA CTAGGCCACG CGTCGCCTAG TACGGGGGGG GGGGGGGGGG 60 
TCAGCGAGTC CTTGCCTCCC GGCGGCTCAG GACGAGGGCA GATCTCGTTC TGGGGCAAGC 120 
CGTTGACACT CGCTCCCTGC CACCGCCCGG GCTCCGTGCC GCCAAGTTTT CATTTTCCAC 180 
CTTCTCTGCC TCCAGTCCCC CAGCCCCTGG CCGAGAGAAG GGTCTTACCG GCCGGGATTG 240 
CTGGAAACAC CAAGAGGTGG TTTTTGTTTT TTAAAACTTC TGTTTCTTGG GAGGGGGTGT 300 
GGCGGGGCAG G ATG AGC AAC TCC GTT CCT CTG CTC TGT TTC TGG AGC CTC 350 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu 
55 60 65 

TGC TAT TGC TTT GCT GCG GGG AGC CCC GTA CCT TTT GGT CCA GAG GGA 398 
Cys Tyr Cys Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly 

70 75 80 

CGG CTG GAA GAT AAG CTC CAC AAA CCC AAA GCT ACA CAG ACT GAG GTC 446 
Arg Leu Glu Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val 

85 90 95 

AAA CCA TCT GTG AGG TTT AAC CTC CGC ACC TCC AAG GAC CCA GAG CAT 494 
Lys Pro Ser Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His 

100 105 110 

GAA GGA TGC TAC CTC TCC GTC GGC CAC AGC CAG CCC TTA GAA GAC TGC 542 
Glu Gly Cys Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys 
115 120 125 130 

AGT TTC AAC ATG ACA GCT AAA ACC TTT TTC ATC ATT CAC GGA TGG ACG 590 
Ser Phe Asn Met Thr Ala Lys Thr Phe Phe lie lie His Gly Trp Thr 
135 140 145 



ATG AGC GGT ATC 
Met Ser Gly lie 
150 

CAC ACA AGA GAG 
His Thr Arg Glu 
165 

CTG GCC CAC CAG 
Leu Ala His Gin 
180 

GGA CAC AGC ATT 
Gly His Ser lie 
195 

TTT TCT CTC GGG 
Phe Ser Leu Gly 

GTG GCC GGG TAT 
Val Ala Gly Tyr 
230 

ACA GGT TTG GAT 
Thr Gly Leu Asp 
245 

AAG AGG CTC TCT 
Lys Arg Leu Ser 
260 

TAC ACG CGT TCC 
Tyr Thr Arg Ser 
275 

CAC ATT GAC ATC 
His lie Asp lie 



TTT GAA AAC TGG 
Phe Glu Asn Trp 

AAA GAC GCC AAT 
Lys Asp Ala Asn 
170 

CTT TAC ACG GAT 
Leu Tyr Thr Asp 
185 

GCC AGG ATG CTC 
Ala Arg Met Leu 
200 

AAT GTC CAC TTG 
Asn Val His Leu 
215 

GCA GGC AAC TTC 
Ala Gly Asn Phe 

CCT GCC GGG CCC 
Pro Ala Gly Pro 
250 

CCG GAC GAT GCA 
Pro Asp Asp Ala 
265 

TTC GGC TTG AGC 
Phe Gly Leu Ser 
280 

TAC CCC AAT GGG 
Tyr Pro Asn Gly 
295 
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CTG CAC AAA CTC 
Leu His Lys Leu 
155 

GTA GTT GTG GTT 
Val Val Val Val 

GCG GTC AAT AAT 
Ala Val Asn Asn 
190 

GAC TGG CTG CAG 
Asp Trp Leu Gin 
205 

ATC GGC TAC AGC 
lie Gly Tyr Ser 
220 

GTG AAA GGA ACG 
Val Lys Gly Thr 
235 

ATG TTT GAA GGG 
Met Phe Glu Gly 

GAT TTT GTG GAT 
Asp Phe Val Asp 
270 

ATT GGT ATT CAG 
lie Gly lie Gin 
285 

GGT GAC TTC CAG 
Gly Asp Phe Gin 
300 



GTG TCA GCC CTG 
Val Ser Ala Leu 
160 

GAC TGG CTC CCC 
Asp Trp Leu Pro 
175 

ACC AGG GTG GTG 
Thr Arg Val Val 

GAG AAG GAC GAT 
Glu Lys Asp Asp 
210 

CTC GGA GCG CAC 
Leu Gly Ala His 
225 

GTG GGC CGA ATC 
Val Gly Arg lie 
240 

GCC GAC ATC CAC 
Ala Asp lie His 
255 

GTC CTC CAC ACC 
Val Leu His Thr 

ATG CCT GTG GGC 
Met Pro Val Gly 
290 

CCA GGC TGT GGA 
Pro Gly Cys Gly 
305 
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CTC AAC GAT GTC TTG GGA TCA ATT GCA TAT GGA AC A ATC ACA GAG GTG 1118 
Leu Asn Asp Val Leu Gly Ser lie Ala Tyr Gly Thr lie Thr Glu Val 

310 315 320 

GTA AAA TGT GAG CAT GAG CGA GCC GTC CAC CTC TTT GTT GAC TCT CTG 1166 
Val Lys Cys Glu His Glu Arg Ala Val His Leu Phe Val Asp Ser Leu 

325 330 335 

GTG AAT CAG GAC AAG CCG AGT TTT GCC TTC CAG TGC ACT GAC TCC AAT 1214 
Val Asn Gin Asp Lys Pro Ser Phe Ala Phe Gin Cys Thr Asp Ser Asn 

340 345 350 

CGC TTC AAA AAG GGG ATC TGT CTG AGC TGC CGC AAG AAC CGT TGT AAT 1262 
Arg Phe Lys Lys Gly lie Cys Leu Ser Cys Arg Lys Asn Arg Cys Asn 
355 360 365 370 

AGC ATT GGC TAC AAT GCC AAG AAA ATG AGG AAC AAG AGG AAC AGC AAA 1310 
Ser lie Gly Tyr Asn Ala Lys Lys Met Arg Asn Lys Arg Asn Ser Lys 

375 380 385 

ATG TAC CTA AAA ACC CGG GCA GGC ATG CCT TTC AGA GGT AAC CTT CAG 1358 
Met Tyr Leu Lys Thr Arg Ala Gly Met Pro Phe Arg Gly Asn Leu Gin 

390 395 400 

TCC CTG GAG TGT CAAGCCGAAT TC 1382 
Ser Leu Glu Cys 
405 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu Cys Tyr Cys 

15 10 15 

Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly Arg Leu Glu 

20 25 30 

Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val Lys Pro Ser 

35 40 45 

Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His Glu Gly Cys 

50 55 60 

Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys Ser Phe Asn 
65 70 75 80 

Met Thr Ala Lys Thr Phe Phe lie lie His Gly Trp Thr Met Ser Gly 

85 90 95 

lie Phe Glu Asn Trp Leu His Lys Leu Val Ser Ala Leu His Thr Arg 

100 105 110 

Glu Lys Asp Ala Asn Val Val Val Val Asp Trp Leu Pro Leu Ala His 

115 120 125 

Gin Leu Tyr Thr Asp Ala Val Asn Asn Thr Arg Val Val Gly His Ser 

130 135 140 

lie Ala Arg Met Leu Asp Trp Leu Gin Glu Lys Asp Asp Phe Ser Leu 
145 150 155 160 

Gly Asn Val His Leu lie Gly Tyr Ser Leu Gly Ala His Val Ala Gly 

165 170 175 

Tyr Ala Gly Asn Phe Val Lys Gly Thr Val Gly Arg lie Thr Gly Leu 

180 185 190 

Asp Pro Ala Gly Pro Met Phe Glu Gly Ala Asp lie His Lys Arg Leu 

195 200 205 

Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu His Thr Tyr Thr Arg 
210 215 220 
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Ser Phe Gly Leu Ser lie Gly lie Gin Met Pro Val Gly His lie Asp 
225 230 235 240 

lie Tyr Pro Asn Gly Gly Asp Phe Gin Pro Gly Cys Gly Leu Asn Asp 

245 250 255 

Val Leu Gly Ser lie Ala Tyr Gly Thr lie Thr Glu Val Val Lys Cys 

260 265 270 

Glu His Glu Arg Ala Val His Leu Phe Val Asp Ser Leu Val Asn Gin 

275 280 285 

Asp Lys Pro Ser Phe Ala Phe Gin Cys Thr Asp Ser Asn Arg Phe Lys 

290 295 300 

Lys Gly lie Cys Leu Ser Cys Arg Lys Asn Arg Cys Asn Ser lie Gly 
305 310 315 320 

Tyr Asn Ala Lys Lys Met Arg Asn Lys Arg Asn Ser Lys Met Tyr Leu 

325 330 335 

Lys Thr Arg Ala Gly Met Pro Phe Arg Gly Asn Leu Gin Ser Leu Glu 
340 345 350 

Cys 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1065 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1065 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATG AGC AAC TCC GTT CCT CTG CTC TGT TTC TGG AGC CTC TGC TAT TGC 48 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu Cys Tyr Cys 

355 360 365 

TTT GCT GCG GGG AGC CCC GTA CCT TTT GGT CCA GAG GGA CGG CTG GAA 96 
Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly Arg Leu Glu 
370 375 380 385 

GAT AAG CTC CAC AAA CCC AAA GCT ACA CAG ACT GAG GTC AAA CCA TCT 144 
Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val Lys Pro Ser 

390 395 400 

GTG AGG TTT AAC CTC CGC ACC TCC AAG GAC CCA GAG CAT GAA GGA TGC 192 
Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His Glu Gly Cys 

405 410 415 

TAC CTC TCC GTC GGC CAC AGC CAG CCC TTA GAA GAC TGC AGT TTC AAC 240 
Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys Ser Phe Asn 

420 425 430 

ATG ACA GCT AAA ACC TTT TTC ATC ATT CAC GGA TGG ACG ATG AGC GGT 288 
Met Thr Ala Lys Thr Phe Phe lie lie His Gly Trp Thr Met Ser Gly 

435 440 445 

ATC TTT GAA AAC TGG CTG CAC AAA CTC GTG TCA GCC CTG CAC ACA AGA 336 
lie Phe Glu Asn Trp Leu His Lys Leu Val Ser Ala Leu His Thr Arg 
450 455 460 465 

GAG AAA GAC GCC AAT GTA GTT GTG GTT GAC TGG CTC CCC CTG GCC CAC 384 
Glu Lys Asp Ala Asn Val Val Val Val Asp Trp Leu Pro Leu Ala His 

470 475 480 

CAG CTT TAC ACG GAT GCG GTC AAT AAT ACC AGG GTG GTG GGA CAC AGC 432 
Gin Leu Tyr Thr Asp Ala Val Asn Asn Thr Arg Val Val Gly His Ser 
485 490 495 
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ATT GCC AGG ATG CTC GAC TGG CTG GAG GAG AAG GAC GAT TTT TCT CTC 480 
lie Ala Arg Met Leu Asp Trp Leu Gin Glu Lys Asp Asp Phe Ser Leu 

500 505 510 

GGG AAT GTC CAC TTG ATC GGC TAC AGC CTC GGA GCG CAC GTG GCC GGG 528 
Gly Asn Val His Leu lie Gly Tyr Ser Leu Gly Ala His Val Ala Gly 

515 520 525 

TAT GCA GGC AAC TTC GTG AAA GGA ACG GTG GGC CGA ATC ACA GGT TTG 576 
Tyr Ala Gly Asn Phe Val Lys Gly Thr Val Gly Arg lie Thr Gly Leu 
530 535 540 545 

GAT CCT GCC GGG CCC ATG TTT GAA GGG GCC GAC ATC CAC AAG AGG CTC 624 
Asp Pro Ala Gly Pro Met Phe Glu Gly Ala Asp lie His Lys Arg Leu 

550 555 560 

TCT CCG GAC GAT GCA GAT TTT GTG GAT GTC CTC CAC ACC TAC ACG CGT 672 
Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu His Thr Tyr Thr Arg 

565 570 575 

TCC TTC GGC TTG AGC ATT GGT ATT CAG ATG CCT GTG GGC CAC ATT GAC 720 
Ser Phe Gly Leu Ser lie Gly lie Gin Met Pro Val Gly His lie Asp 

580 585 590 

ATC TAC CCC AAT GGG GGT GAC TTC CAG CCA GGC TGT GGA CTC AAC GAT 7 68 

lie Tyr Pro Asn Gly Gly Asp Phe Gin Pro Gly Cys Gly Leu Asn Asp 

595 600 605 

GTC TTG GGA TCA ATT GCA TAT GGA ACA ATC ACA GAG GTG GTA AAA TGT 816 
Val Leu Gly Ser lie Ala Tyr Gly Thr lie Thr Glu Val Val Lys Cys 
610 '615 620 625 

GAG CAT GAG CGA GCC GTC CAC CTC TTT GTT GAC TCT CTG GTG AAT CAG 864 
Glu His Glu Arg Ala Val His Leu Phe Val Asp Ser Leu Val Asn Gin 

630 635 640 

GAC AAG CCG AGT TTT GCC TTC CAG TGC ACT GAC TCC AAT CGC TTC AAA 912 
Asp Lys Pro Ser Phe Ala Phe Gin Cys Thr Asp Ser Asn Arg Phe Lys 
645 650 655 ^ 



• 
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AAG GGG ATC TGT CTG AGC 
Lys Gly lie Cys Leu Ser 
660 

TAC AAT GCC AAG AAA ATG 
Tyr Asn Ala Lys Lys Met 
675 

AAA ACC CGG GCA GGC ATG 
Lys Thr Arg Ala Gly Met 
690 695 
TGT CCC TGA 
Cys Pro * 



TGC CGC AAG AAC CGT TGT 
Cys Arg Lys Asn Arg Cys 
665 

AGG AAC AAG AGG AAC AGC 
Arg Asn Lys Arg Asn Ser 
680 685 
CCT TTC AGA GGT AAC CTT 
Pro Phe Arg Gly Asn Leu 
700 



AAT AGC ATT GGC 960 

Asn Ser lie Gly 

670 

AAA ATG TAC CTA 1008 
Lys Met Tyr Leu 

CAG TCC CTG GAG 1056 
Gin Ser Leu Glu 
705 

1065 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu Cys Tyr Cys 

15 10 15 

Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly Arg Leu Glu 

20 25 30 

Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val Lys Pro Ser 

35 40 45 

Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His Glu Gly Cys 

50 55 60 

Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys Ser Phe Asn 
65 70 75 80 



Met Thr Ala Lys 

lie Phe Glu Asn 
100 

Glu Lys Asp Ala 
115 

Gin Leu Tyr Thr 
130 

lie Ala Arg Met 
145 

Gly Asn Val His 

Tyr Ala Gly Asn 
180 

Asp Pro Ala Gly 
195 

Ser Pro Asp Asp 
210 

Ser Phe Gly Leu 
225 

lie Tyr Pro Asn 

Val Leu Gly Ser 
260 

Glu His Glu Arg 
275 

Asp Lys Pro Ser 
290 

Lys Gly lie Cys 
305 




Thr Phe Phe lie 
85 

Trp Leu His Lys 

Asn Val Val Val 
120 

Asp Ala Val Asn 
135 

Leu Asp Trp Leu 
150 

Leu lie Gly Tyr 
165 

Phe Val Lys Gly 

Pro Met Phe Glu 
200 

Ala Asp Phe Val 
215 

Ser lie Gly lie 
230 

Gly Gly Asp Phe 
245 

lie Ala Tyr Gly 

Ala Val His Leu 
280 

Phe Ala Phe Gin 
295 

Leu Ser Cys Arg 
310 
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lie His Gly Trp 
90 

Leu Val Ser Ala 
105 

Val Asp Trp Leu 

Asn Thr Arg Val 
140 

Gin Glu Lys Asp 
155 

Ser Leu Gly Ala 
170 

Thr Val Gly Arg 
185 

Gly Ala Asp lie 

Asp Val Leu His 
220 

Gin Met Pro Val 
235 

Gin Pro Gly Cys 
250 

Thr lie Thr Glu 
265 

Phe Val Asp Ser 

Cys Thr Asp Ser 
300 

Lys Asn Arg Cys 
315 




Thr Met Ser Gly 
95 

Leu His Thr Arg 
110 

Pro Leu Ala His 
125 

Val Gly His Ser 

Asp Phe Ser Leu 
160 

His Val Ala Gly 
175 

lie Thr Gly Leu 
190 

His Lys Arg Leu 
205 

Thr Tyr Thr Arg 

Gly His lie Asp 
240 

Gly Leu Asn Asp 
255 

Val Val Lys Cys 
270 

Leu Val Asn Gin 
285 

Asn Arg Phe Lys 

Asn Ser lie Gly 
320 
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Tyr Asn Ala Lys Lys Met Arg Asn Lys Arg Asn Ser Lys Met Tyr Leu 



325 



330 



335 



Lys Thr Arg Ala Gly Met Pro Phe Arg Gly Asn Leu Gin Ser Leu Glu 



340 



345 



350 



Cys Pro * 
355 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

{ ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 252.. 1754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GAATTCGCGG CCGCGTCGAC GGCGGCTCAG GACGAGGGCA GATCTCGTTC TGGGGCAAGC 60 
CGTTGACACT CGCTCCCTGC CACCGCCCGG GCTCCGTGCC GCCAAGTTTT CATTTTCCAC 120 
CTTCTCTGCC TCCAGTCCCC CAGCCCCTGG CCGAGAGAAG GGTCTTACCG GCCGGGATTG 180 
CTGGAAACAC CAAGAGGTGG TTTTTGTTTT TTAAAACTTC TGTTTCTTGG GAGGGGGTGT 240 
GGCGGGGCAG G ATG AGC AAC TCC GTT CCT CTG CTC TGT TTC TGG AGC CTC 290 



Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu 



360 



365 



TGC TAT TGC TTT GCT GCG GGG AGC CCC GTA CCT TTT GGT CCA GAG GGA 



338 



Cys Tyr Cys Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly 



370 



375 



380 
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CGG CTG GAA GAT AAG CTC CAC AAA CCC AAA GCT ACA CAG ACT GAG GTC 386 

Arg Leu Glu Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val ; 

385 390 395 400 

AAA CCA TCT GTG AGG TTT AAC CTC CGC ACC TCC AAG GAC CCA GAG CAT 434 

Lys Pro Ser Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His 

405 410 415 

GAA GGA TGC TAC CTC TCC GTC GGC CAC AGC CAG CCC TTA GAA GAC TGC 482 
Glu Gly Cys Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys 

420 425 430 

AGT TTC AAC ATG ACA GCT AAA ACC TTT TTC ATC ATT CAC GGA TGG ACG 530 
Ser Phe Ash Met Thr Ala Lys Thr Phe Phe lie lie His Gly Trp Thr 

435 440 445 

ATG AGC GGT ATC TTT GAA AAC TGG CTG CAC AAA CTC GTG TCA GCC CTG , 578 
Met Ser Gly lie Phe Glu Asn Trp Leu His Lys Leu Val Ser Ala Leu 

450 455 460 

CAC ACA AGA GAG AAA GAC GCC AAT GTA GTT GTG GTT GAC TGG CTC CCC 626 
His Thr Arg Glu Lys Asp Ala Asn Val Val Val Val Asp Trp Leu Pro 
465 470 475 480 

CTG GCC CAC CAG CTT TAC ACG GAT GCG GTC AAT AAT ACC AGG GTG GTG 674 
Leu Ala His Gin Leu Tyr Thr Asp Ala Val Asn Asn Thr Arg Val Val 

485 490 495 

GGA CAC AGC ATT GCC AGG ATG CTC GAC TGG CTG CAG GAG AAG GAC GAT 722 
Gly His Ser lie Ala Arg Met Leu Asp Trp Leu Gin Glu Lys Asp Asp 

500 505 510 

TTT TCT CTC GGG* AAT GTC CAC TTG ATC GGC TAC AGC CTC GGA GCG CAC 770 
Phe Ser Leu Gly Asn Val His Leu lie Gly Tyr Ser Leu Gly Ala His 

515 520 525 

GTG GCC GGG TAT GCA GGC AAC TTC GTG AAA GGA ACG GTG GGC CGA ATC 818 
Val Ala Gly Tyr Ala Gly Asn Phe Val Lys Gly Thr Val Gly Arg lie 
530 535 540 



ACA GGT TTG GAT 
Thr Gly Leu Asp 
545 

AAG AGG CTC TCT 
Lys Arg Leu Ser 

TAC ACG CGT TCC 
Tyr Thr Arg Ser 
580 

CAC ATT GAC ATC 
His lie Asp lie 
595 

CTC AAC GAT GTC 
Leu Asn Asp Val 
610 

GTA AAA TGT GAG 
Val Lys Cys Glu 
625 

GTG AAT CAG GAC 
Val Asn Gin Asp 

CGC TTC AAA AAG 
Arg Phe Lys Lys 
660 

AGC ATT GGC TAC 
Ser lie Gly Tyr 
675 

ATG TAC CTA AAA 
Met Tyr Leu Lys 
690 



CCT GCC GGG CCC 
Pro Ala Gly Pro 
550 

CCG GAC GAT GCA 
Pro Asp Asp Ala 
565 

TTC GGC TTG AGC 
Phe Gly Leu Ser 

TAC CCC AAT GGG 
Tyr Pro Asn Gly 
600 

TTG GGA TCA ATT 
Leu Gly Ser lie 
615 

CAT GAG CGA GCC 
His Glu Arg Ala 
630 

AAG CCG AGT TTT 
Lys Pro Ser Phe 
645 

GGG ATC TGT CTG 
Gly lie Cys Leu 

AAT GCC AAG AAA 
Asn Ala Lys Lys 
680 

ACC CGG GCA GGC 
Thr Arg Ala Gly 
695 
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ATG TTT GAA GGG 
Met Phe Glu Gly 
555 

GAT TTT GTG GAT 
Asp Phe Val Asp 
570 

ATT GGT ATT CAG 
lie Gly lie Gin 
585 

GGT GAC TTC CAG 
Gly Asp Phe Gin 

GCA TAT GGA ACA 
Ala Tyr Gly Thr 
620 

GTC CAC CTC TTT 
Val His Leu Phe 
635 

GCC TTC CAG TGC 
Ala Phe Gin Cys 
650 

AGC TGC CGC AAG 
Ser Cys Arg Lys 
665 

ATG AGG AAC AAG 
Met Arg Asn Lys 

ATG CCT TTC AGA 
Met Pro Phe Arg 
700 



GCC GAC ATC CAC 
Ala Asp lie His 
560 

GTC CTC CAC ACC 
Val Leu His Thr 
575 

ATG CCT GTG GGC 
Met Pro Val Gly 
590 

CCA GGC TGT GGA 
Pro Gly Cys Gly 
605 

ATC ACA GAG GTG 
lie Thr Glu Val 

GTT GAC TCT CTG 
Val Asp Ser Leu 
640 

ACT GAC TCC AAT 
Thr Asp Ser Asn 
655 

AAC CGT TGT AAT 
Asn Arg Cys Asn 
670 

AGG AAC AGC AAA 
Arg Asn Ser Lys 
685 

GTT TAC CAT TAT 
Val Tyr His Tyr 
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CAG ATG AAA ATC CAT GTC TTC AGT TAC AAG AAC ATG GGA GAA ATT GAG 1346 

Gin Met Lys lie His Val Phe Ser Tyr Lys Asn Met Gly Glu lie Glu 

705 710 715 720 

CCC ACC TTT TAC GTC ACC CTT TAT GGC ACT AAT GCA GAT TCC CAG ACT 1394 

Pro Thr Phe Tyr Val Thr Leu Tyr Gly Thr Asn Ala Asp Ser Gin Thr 

725 730 735 

CTG CCA CTG GAA ATA GTG GAG CGG ATC GAG CAG AAT GCC ACC AAC ACC 1442 
Leu Pro Leu Glu lie Val Glu Arg lie Glu Gin Asn Ala Thr Asn Thr 

740 745 750 

TTC CTG GTC TAC ACC GAG GAG GAC TTG GGA GAC CTC TTG AAG ATC CAG 1490 
Phe Leu Val Tyr Thr Glu Glu Asp Leu Gly Asp Leu Leu Lys lie Gin 

755 760 765 

CTC ACC TGG GAG GGG GCC TCT CAG TCT TGG TAC AAC CTG TGG AAG GAG 1538 
Leu Thr Trp Glu Gly Ala Ser Gin Ser Trp Tyr Asn Leu Trp Lys Glu 

770 775 780 

TTT CGC AGC TAC CTG TCT CAA CCC CGC AAC CCC GGA CGG GAG CTG AAT 1586 
Phe Arg Ser Tyr Leu Ser Gin Pro Arg Asn Pro Gly Arg Glu Leu Asn 
785 790 795 800 

ATC AGG CGC ATC CGG GTG AAG TCT GGG GAA ACC CAG CGG AAA CTG ACA 1634 
lie Arg Arg lie Arg Val Lys Ser Gly Glu Thr Gin Arg Lys Leu Thr , 

805 810 815 

TTT TGT ACA GAA GAC CCT GAG AAC ACC AGC ATA TCC CCA GGC CGG GAG 1682 
Phe Cys Thr Glu Asp Pro Glu Asn Thr Ser lie Ser Pro Gly Arg Glu 

820 825 830 

CTC TGG TTT CGC AAG TGT CGG GAT GGC TGG AGG ATG AAA AAC GAA ACC 1730 
Leu Trp Phe Arg Lys Cys Arg Asp Gly Trp Arg Met Lys Asn Glu Thr 

835 840 845 

AGT CCC ACT GTG GAG CTT CCC TGA GGGTGCCCGG GCAAGTCTTG CCAGCAAGGC 1784 
Ser Pro Thr Val Glu Leu Pro * 
850 855 
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AGCAAGACTT CCTGCTATCC AAGCCCATGG AGGAAAGTTA CTGCTGAGGA CCCACCCAAT 1844 

GGAAGGATTC TTCTCAGCCT TGACCCTGGA GCACTGGGAA CAACTGGTCT CCTGTGATGG 1904 

CTGGGACTCC TCGCGGGAGG GGACTGCGCT GCTATAGCTC TTGCTGCCTC TCTTGAATAG 1964 

CTCTAACTCC AAACCTCTGT CCACACCTCC AGAGCACCAA GTCCAGATTT GTGTGTAAGC 2024 

AGCTGGGTGC CTGGGGCCTC TCGTGCACAC TGGATTGGTT TCTCAGTTGC TGGGCGAGCC 2084 

TGTACTCTGC CTGACGAGGA ACGCTGGCTC CGAAGAGGCC CTGTGTAGAA GGCTGTCAGC 2144 

TGCTCAGCCT GCTTTGAGCC TCAGTGAGAA GTCCTTCCGA CAGGAGCTGA CTCATGTCAG 2204 

GATGGCAGGC CTGGTATCTT GCTCGGGCCC TGGCTGTTGG GGTTCTCATG GGTTGCACTG 2264 

ACCATACTGC TTACGTCTTA GCCATTCCGT CCTGCTCCCC AGCTCACTCT CTGAAGCACA 2324 

CATCATTGGC TTTCCTATTT TTCTGTTCAT TTTTTAATTG AGCAAATGTC TATTGAACAC 2384 

TTAAAATTAA TTAGAATGTG GTAATGGACA TATTACTGAG CCTCTCCATT TGGAACCCAG 2444 

TGGAGTTGGG ATTTCTAGAC CCTCTTTCTG TTTGGATGGT GTATGTGTAT ATGCATGGGG 2504 

AAAGGCACCT GGGGCCTGGG GGAGGCTATA GGATATAAGC AGTCGACGCG GCCGCGAATT 2564 

C 2565 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu Cys Tyr Cys 

15 10 15 

Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly Arg Leu Glu 

20 25 30 

Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val Lys Pro Ser 

35 40 45 

Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His Glu Gly Cys 
50 55 60 



• # 
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Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys Ser Phe Asn 
65 70 75 80 

Met Thr Ala Lys Thr Phe Phe lie lie His Gly Trp Thr Met Ser Gly 

85 90 95 

lie Phe Glu Asn Trp Leu His Lys Leu Val Ser Ala Leu His Thr Arg 

100 105 110 

Glu Lys Asp Ala Asn Val Val Val Val Asp Trp Leu Pro Leu Ala His 

115 120 125 

Gin Leu Tyr Thr Asp Ala Val Asn Asn Thr Arg Val Val Gly His Ser 

130 135 140 

lie Ala Arg Met . Leu Asp Trp Leu Gin Glu Lys Asp Asp Phe Ser Leu 
145 150 155 160 

Gly Asn Val His Leu lie Gly Tyr Ser Leu Gly Ala His Val Ala Gly 

165 170 175 

Tyr Ala Gly Asn Phe Val Lys Gly Thr Val Gly Arg lie Thr Gly Leu 

180 185 190 

Asp Pro Ala Gly Pro Met Phe Glu Gly Ala Asp lie His Lys Arg Leu 

195 200 205 

Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu His Thr Tyr Thr Arg 

210 215 220 

Ser Phe Gly Leu Ser lie Gly lie Gin Met Pro Val Gly His lie Asp 
225 230 235 240 

lie Tyr Pro Asn Gly Gly Asp Phe Gin Pro Gly Cys Gly Leu Asn Asp 

245 250 255 

Val Leu Gly Ser lie Ala Tyr Gly Thr lie Thr Glu Val Val Lys Cys 

260 265 270 

Glu His Glu Arg Ala Val His Leu Phe Val Asp Ser Leu Val Asn Gin 

275 280 285 

Asp Lys Pro Ser Phe Ala Phe Gin Cys Thr Asp Ser Asn Arg Phe Lys 
290 295 300 
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Lys Gly lie Cys Leu Ser Cys Arg Lys Asn Arg Cys Asn Ser lie Gly 
305 310 315 320 

Tyr Asn Ala Lys Lys Met Arg Asn Lys Arg Asn Ser Lys Met Tyr Leu 

325 330 ~~ 335 

Lys Thr Arg Ala Gly Met Pro Phe Arg Val Tyr His Tyr Gin Met Lys 

340 345 350 

lie His Val Phe Ser Tyr Lys Asn Met Gly Glu lie Glu Pro Thr Phe 

355 360 365 

Tyr Val Thr Leu Tyr Gly Thr Asn Ala Asp Ser Gin Thr Leu Pro Leu 

370 375 380 

Glu lie Val Glu Arg lie Glu Gin Asn Ala Thr Asn Thr Phe Leu Val 
385 390 395 400 

Tyr Thr Glu Glu Asp Leu Gly Asp Leu Leu Lys lie Gin Leu Thr Trp 

405 410 415 

Glu Gly Ala Ser Gin Ser Trp Tyr Asn Leu Trp Lys Glu Phe Arg Ser 

420 425 430 

Tyr Leu Ser Gin Pro Arg Asn Pro Gly Arg Glu Leu Asn lie Arg Arg 

435 440 445 

lie Arg Val Lys Ser Gly Glu Thr Gin Arg Lys Leu Thr Phe Cys Thr 

450 455 460 

Glu Asp Pro Glu Asn Thr Ser lie Ser Pro Gly Arg Glu Leu Trp Phe 
465 470 475 480 

Arg Lys Cys Arg Asp Gly Trp Arg Met Lys Asn Glu Thr Ser Pro Thr 

485 490 495 

Val Glu Leu Pro * 
500 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 
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( B ) TYPE : nuc leic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ATG AGC AAC TCC GTT CCT CTG CTC TGT TTC TGG AGC CTC TGC TAT TGC 48 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu Cys Tyr Cys 

505 510 515 

TTT GCT GCG GGG AGC CCC GTA CCT TTT GGT CCA GAG GGA CGG CTG GAA 96 
Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly Arg Leu Glu 

520 525 530 

GAT AAG CTC CAC AAA CCC AAA GCT ACA CAG ACT GAG GTC AAA CCA TCT 144 
Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val Lys Pro Ser 

535 540 545 

GTG AGG TTT AAC CTC CGC ACC TCC AAG GAC CCA GAG CAT GAA GGA TGC 192 
Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His Glu Gly Cys 
550 555 560 565 

TAC CTC TCC GTC GGC CAC AGC CAG CCC TTA GAA GAC TGC AGT TTC AAC 240 
Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys Ser Phe Asn 

570 575 580 

ATG ACA GCT AAA ACC TTT TTC ATC ATT CAC GGA TGG ACG ATG AGC GGT 288 
Met Thr Ala Lys Thr Phe Phe lie lie His Gly Trp Thr Met Ser Gly 

585 590 595 

ATC TTT GAA AAC TGG CTG CAC AAA CTC GTG TCA GCC CTG CAC ACA AGA 336 
lie Phe Glu Asn Trp Leu His Lys Leu Val Ser Ala Leu His Thr Arg 
600 605 610 
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GAG AAA GAC GCC AAT GTA GTT GTG GTT GAG TGG CTC CCC CTG GCC CAC 384 
Glu Lys Asp Ala Asn Val Val Val Val Asp Trp Leu Pro Leu Ala His 

615 620 625 

CAG CTT TAC ACG GAT GCG GTC AAT AAT ACC AGG GTG GTG GGA CAC AGC 432 
Gin Leu Tyr Thr Asp Ala Val Asn Asn Thr Arg Val Val Gly His Ser 
630 635 640 645 

ATT GCC AGG ATG CTC GAC TGG CTG CAG GAG AAG GAC GAT TTT TCT CTC 480 
lie Ala Arg Met Leu Asp Trp Leu Gin Glu Lys Asp Asp Phe Ser Leu 

650 655 660 

GGG AAT GTC CAC TTG ATC GGC TAC AGC CTC GGA GCG CAC GTG GCC GGG 528 
Gly Asn Val His Leu lie Gly Tyr Ser Leu Gly Ala His Val Ala Gly 

665 670 675 

TAT GCA GGC AAC TTC GTG AAA GGA ACG GTG GGC CGA ATC ACA GGT TTG 576 
Tyr Ala Gly Asn Phe Val Lys Gly Thr Val Gly Arg lie Thr Gly Leu 

680 685 690 

GAT CCT GCC GGG CCC ATG TTT GAA GGG GCC GAC ATC CAC AAG AGG CTC 624 
Asp Pro Ala Gly Pro Met Phe Glu Gly Ala Asp lie His Lys Arg Leu 

695 700 705 

TCT CCG GAC GAT GCA GAT TTT GTG GAT GTC CTC CAC ACC TAC ACG CGT 672 
Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu His Thr Tyr Thr Arg 
710 715 720 725 

TCC TTC GGC TTG AGC ATT GGT ATT CAG ATG CCT GTG GGC CAC ATT GAC 720 
Ser Phe Gly Leu Ser lie Gly lie Gin Met Pro Val Gly His lie Asp 

730 735 740 

ATC TAC CCC AAT GGG GGT GAC TTC CAG CCA GGC TGT GGA CTC AAC GAT 768 
lie Tyr Pro Asn Gly Gly Asp Phe Gin Pro Gly Cys Gly Leu Asn Asp 

745 750 755 

GTC TTG GGA TCA ATT GCA TAT GGA ACA ATC ACA GAG GTG GTA AAA TGT 816 
Val Leu Gly Ser lie Ala Tyr Gly Thr lie Thr Glu Val Val Lys Cys 
760 765 770 




GAG CAT GAG CGA GCC GTC 
Glu His Glu Arg Ala Val 
775 

GAC AAG CCG AGT TTT GCC 
Asp Lys Pro Ser Phe Ala 
790 795 
AAG GGG ATC TGT CTG AGC 
Lys Gly lie Cys Leu Ser 
810 

TAC AAT GCC AAG AAA ATG 
Tyr Asn Ala Lys Lys Met 
825 

AAA ACC CGG GCA GGC ATG 
Lys Thr Arg Ala Gly Met 
840 
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CAC CTC TTT GTT GAC TCT 
His Leu Phe Val Asp Ser 
780 785 
TTC CAG TGC ACT GAC TCC 
Phe Gin Cys Thr Asp Ser 
800 

TGC CGC AAG AAC CGT TGT 
Cys Arg Lys Asn Arg Cys 
815 

AGG AAC AAG AGG AAC AGC 
Arg Asn Lys Arg Asn Ser 
830 

CCT TTC AGA 
Pro Phe Arg 
845 




CTG GTG AAT CAG 864 
Leu Val Asn Gin 

AAT CGC TTC AAA 912 
Asn Arg Phe Lys 
805 

AAT AGC ATT GGC 960 
Asn Ser lie Gly 
820 

AAA ATG TAC CTA 1008 
Lys Met Tyr Leu 
835 

1035 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE : protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Met Ser Asn Ser Val Pro Leu Leu Cys Phe Trp Ser Leu Cys Tyr Cys 

15 10 15 

Phe Ala Ala Gly Ser Pro Val Pro Phe Gly Pro Glu Gly Arg Leu Glu 

20 25 30 

Asp Lys Leu His Lys Pro Lys Ala Thr Gin Thr Glu Val Lys Pro Ser 
35 40 45 
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Val Arg Phe Asn Leu Arg Thr Ser Lys Asp Pro Glu His Glu Gly Cys 

50 55 60 

Tyr Leu Ser Val Gly His Ser Gin Pro Leu Glu Asp Cys Ser Phe Asn 
65 70 75 80 

Met Thr Ala Lys Thr Phe Phe lie He His Gly Trp Thr Met Ser Gly 

85 90 95 

He Phe Glu Asn Trp Leu His Lys Leu Val Ser Ala Leu His Thr Arg 

100 105 110 

Glu Lys Asp Ala Asn Val Val Val Val Asp Trp Leu Pro Leu Ala His 

115 120 125 

Gin Leu Tyr Thr Asp Ala Val Asn Asn Thr Arg Val Val Gly His Ser 

130 135 140 

He Ala Arg Met Leu Asp Trp Leu Gin Glu Lys Asp Asp Phe Ser Leu 
145 150 155 160 

Gly Asn Val His Leu He Gly Tyr Ser Leu Gly Ala His Val Ala Gly 

165 170 175 

Tyr Ala Gly Asn Phe Val Lys Gly Thr Val Gly Arg He Thr Gly Leu 

180 185 190 

Asp Pro Ala Gly Pro Met Phe Glu Gly Ala Asp He His Lys Arg Leu 

195 200 205 

Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu His Thr Tyr Thr Arg 

210 215 220 

Ser Phe Gly Leu Ser He Gly He Gin Met Pro Val Gly His He Asp 
225 230 235 240 

He Tyr Pro Asn Gly Gly Asp Phe Gin Pro Gly Cys Gly Leu Asn Asp 

245 250 255 

Val Leu Gly Ser He Ala Tyr Gly Thr He Thr Glu Val Val Lys Cys 

260 265 270 

Glu His Glu Arg Ala Val His Leu Phe Val Asp Ser Leu Val Asn Gin 
275 280 285 



Asp Lys Pro Ser 
290 

Lys Gly lie Cys 
305 

Tyr Asn Ala Lys 

Lys Thr Arg Ala 
340 



Phe Ala Phe Gin 
295 

Leu Ser Cys Arg 
310 

Lys Met Arg Asn 
325 

Gly Met Pro Phe 
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Cys Thr Asp Ser 
300 

Lys Asn Arg Cys 
315 

Lys Arg Asn Ser 
330 

Arg 
345 



Asn Arg Phe Lys 

Asn Ser lie Gly 
320 

Lys Met Tyr Leu 
335 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTG GGA TCC ATC GCC TAT GGC ACG ATC GCG GAG GTG GTG AAG TGC GAG 
Leu Gly Ser lie Ala Tyr Gly Thr lie Ala Glu Val Val Lys Cys Glu 

350 355 360 

CAT GAG CGG GCC GTG CAT CTC TTT GTG GAC TCC CTG GTG AAC CAG GAC 
His Glu Arg Ala Val His Leu Phe Val Asp Ser Leu Val Asn Gin Asp 

365 370 375 

AAG CCG AGC TTT GCC TTC CAG TGC ACA GAC TCC AAC CGC TTC AAA AAA 
Lys Pro Ser Phe Ala Phe Gin Cys Thr Asp Ser Asn Arg Phe Lys Lys 

380 385 390 

GGG ATC TGT CTC AGC TGC CGG AAG AAC CGC TGT AAC GGC ATC GGC TAC 
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Gly lie Cys Leu Ser Cys Arg Lys Asn Arg Cys Asn Gly lie Gly Tyr 

395 400 405 

AAT GCT AAG AAG ACG AGG AAT AAG AGG AAC ACC 225 
Asn Ala Lys Lys Thr Arg Asn Lys Arg Asn Thr 
410 415 420 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Leu Gly Ser lie Ala Tyr Gly Thr lie Ala Glu Val Val Lys Cys Glu 

15 10 15 

His Glu Arg Ala Val His Leu Phe Val Asp Ser Leu Val Asn Gin Asp 

20 25 30 

Lys Pro Ser Phe Ala Phe Gin Cys Thr Asp Ser Asn Arg Phe Lys Lys 

35 40 45 

Gly lie Cys Leu Ser Cys Arg Lys Asn Arg Cys Asn Gly lie Gly Tyr 

50 55 60 

Asn Ala Lys Lys Thr Arg Asn Lys Arg Asn Thr 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Glu Ser Lys Ala Leu Leu Val Leu Thr Leu Ala Val Trp Leu Gin 

1 5 10 15 

Ser Leu Thr Ala Ser Arg Gly Gly Val Ala Ala Ala Asp Gin Arg Arg 

20 25 30 

Asp Phe lie Asp lie Glu Ser Lys Phe Ala Leu Arg Thr Pro Glu Asp 

35 40 45 

Thr Ala Glu Asp Thr Cys His Leu lie Pro Gly Val Ala Glu Ser Val 

50 55 60 

Ala Thr Cys His Phe Asn His Ser Ser Lys Thr Phe Met Val lie His 
65 70 75 80 

Gly Trp Thr Val Thr Gly Met Tyr Glu Ser Trp Val Pro Lys Leu Val 

85 90 95 

Ala Ala Leu Tyr Lys Arg Glu Pro Asp Ser Asn Val lie Val Val Asp 

100 105 110 

Trp Leu Ser Arg Ala Gin Glu His Tyr Pro Val Ser Ala Gly Tyr Thr 

115 120 125 

Lys Val Gly Gin Asp Val Ala Arg Phe lie Asn Trp Met Glu Glu Glu 

130 135 140 

Phe Asn Tyr Pro Leu Asp Asn Val His Leu Leu Gly Tyr Ser Leu Gly 
145 150 155 160 

Ala His Ala Ala Gly lie Ala Gly Ser Leu Thr Asn Lys Lys Val Asn 

165 170 175 

Arg lie Thr Gly Leu Asp Pro Ala Gly Pro Asn Phe Glu Tyr Ala Glu 

180 185 190 

Ala Pro Arg Leu Ser Pro Asp Asp Ala Asp Phe Val Asp Val Leu His 

195 200 205 

Thr Phe Thr Arg Gly Ser Pro Gly Arg Ser lie Gly lie Gin Lys Pro 
210 215 220 
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Val Gly His Val Asp lie Tyr Pro Asn Gly Gly Thr Phe Gin Pro Gly 
225 230 235 240 

Cys Asn lie Gly Glu Ala lie Arg Val lie Ala Glu Arg Gly Leu Gly 

245 250 255 

Asp Val Asp Gin Leu Lys Cys Ser His Glu Arg Ser lie His Leu Phe 

260 265 270 

lie Asp Ser Leu Leu Asn Glu Glu Asn Pro Ser Lys Ala Tyr Arg Cys 

275 280 285 

Ser Ser Lys Glu Ala Phe Glu Lys Gly Leu Cys Leu Ser Cys Arg Lys 

290 295 300 

Asn Arg Cys Asn Asn Leu Gly Tyr Glu lie Asn Lys Val Arg Ala Lys 
305 310 315 320 

Arg Ser Ser Lys Met Tyr Leu Lys Thr Arg Ser Gin Met Pro Tyr Lys 

325 330 335 

Val Phe His Tyr Gin Val Lys lie His Phe Ser Gly Thr Glu Ser Glu 

340 345 350 

Thr His Thr Asn Gin Ala Phe Glu lie Ser Leu Tyr Gly Thr Val Ala 

355 360 365 

Glu Ser Glu Asn lie Pro Phe Thr Leu Pro Glu Val Ser Thr Asn Lys 

370 375 380 

Thr Tyr Ser Phe Leu lie Tyr Thr Glu Val Asp lie Gly Glu Leu Leu 
385 390 395 400 

Met Leu Lys Leu Lys Trp Lys Ser Asp Ser Tyr Phe Ser Trp Ser Asp 

405 410 415 

Trp Trp Ser Ser Pro Gly Phe Ala lie Gin Lys lie Arg Val Lys Ala 

420 425 430 

Gly Glu Thr Gin Lys Lys Val lie Phe Cys Ser Arg Glu Lys Val Ser 

435 440 445 

His Leu Gin Lys Gly Lys Ala Pro Ala Val Phe Val Lys Cys His Asp 
450 455 460 
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Lys Ser Leu Asn Lys Lys Ser Gly 
465 470 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asp Thr Ser Pro Leu Cys Phe Ser lie Leu Leu Val Leu Cys lie 

15 10 15 

Phe lie Gin Ser Ser Ala Leu Gly Gin Ser Leu Lys Pro Glu Pro Phe 

20 25 30 

Gly Arg Arg Ala Gin Ala Val Glu Thr Asn Lys Thr Leu His Glu Met 

35 40 45 

Lys Thr Arg Phe Leu Leu Phe Gly Glu Thr Asn Gin Gly Cys Gin lie 

50 55 60 

Arg lie Asn His Pro Asp Thr Leu Gin Glu Cys Gly Phe Asn Ser Ser 
65 70 75 80 

Leu Pro Leu Val Met lie He His Gly Trp Ser Val Asp Gly Val Leu 

85 90 95 

Glu Asn Trp He Trp Gin Met Val Ala Ala Leu Lys Ser Gin Pro Ala 

100 105 110 

Gin Pro Val Asn Val Gly Leu Val Asp Trp He Thr Leu Ala His Asp 

115 120 125 

His Tyr Thr He Ala Val Arg Asn Thr Arg Leu Val Gly Lys Glu Val 
130 135 140 
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Ala Ala Leu Leu Arg Trp Leu Glu Glu Ser Val Gin Leu Ser Arg Ser 
145 150 155 160 

His Val His Leu lie Gly Tyr Ser Leu Gly Ala His Val Ser Gly Phe 

165 170 175 

Ala Gly Ser Ser lie Gly Gly Thr His Lys lie Gly Arg lie Thr Gly 

180 185 190 

Leu Asp Ala Ala Gly Pro Leu Phe Glu Gly Ser Ala Pro Ser Asn Arg 

195 200 205 

Leu Ser Pro Asp Asp Ala Asn Phe Val Asp Ala lie His Thr Phe Thr 

210 215 220 

Arg Glu His Met Gly Leu Ser Val Gly lie Lys Gin Pro lie Gly His 
225 230 235 240 

Tyr Asp Phe Tyr Pro Asn Gly Gly Ser Phe Gin Pro Gly Cys His Phe 

245 250 255 

Leu Glu Leu Tyr Arg His lie Ala Gin His Gly Phe Asn Ala lie Thr 

260 265 270 

Gin Thr lie Lys Cys Ser His Glu Arg Ser Val His Leu Phe lie Asp 

275 280 285 

Ser Leu Leu His Ala Gly Thr Gin Ser Met Ala Tyr Pro Cys Gly Asp 

290 295 300 

Met Asn Ser Phe Ser Gin Gly Leu Cys Leu Ser Cys Lys Lys Gly Arg 
305 310 315 320 

Cys Asn Thr Leu Gly Tyr His Val Arg Gin Glu Pro Arg Ser Lys Ser 

325 330 335 

Lys Arg Leu Phe Leu Val Thr Arg Ala Gin Ser Pro Phe Lys Val Tyr 

340 345 350 

His Tyr Gin Leu Lys lie Gin Phe lie Asn Gin Thr Glu Thr Pro lie 

355 360 365 

Gin Thr Thr Phe Thr Met Ser Leu Leu Gly Thr Lys Glu Lys Met Gin 
370 375 380 




Lys lie Pro lie Thr 
385 

Ser Phe Leu lie Thr 
405 

Lys Phe Lys Trp Glu 
420 

Val Gin Thr He He 
435 

Val Leu Lys Thr He 
450 

Thr Phe Cys Ser Glu 
465 

Glu Lys He Phe Val 
485 

Lys He Arg 
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Leu Gly Lys Gly lie 
390 

Leu Asp Val Asp He 
410 

Asn Ser Ala Val Trp 
425 

Pro Trp Ser Thr Gly 
440 

Arg Val Lys Ala Gly 
455 

Asn Thr Asp Asp Leu 
470 

Lys Cys Glu He Lys 
490 




Ala Ser Asn Lys Thr Tyr 
395 400 
Gly Glu Leu He Met He 
415 

Ala Asn Val Trp Asp Thr 
430 

Pro Arg His Ser Gly Leu 
445 

Glu Thr Gin Gin Arg Met 
460 

Leu Leu Arg Pro Thr Gin 
475 480 
Ser Lys Thr Ser Lys Arg 
495 



(2) INFORMATION FOR SEQ ID NO:15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Leu Pro Leu Trp Thr Leu Ser Leu Leu Leu Gly Ala Val Ala Gly 

15 10 15 

Lys Glu Val Cys Tyr Glu Arg Leu Gly Cys Phe Ser Asp Asp Ser Pro 

20 25 30 

Trp Ser Gly He Thr Glu Arg Pro Leu His He Leu Pro Trp Ser Pro 
35 40 45 
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Lys Asp Val Asn Thr Arg Phe Leu Leu Tyr Thr Asn Glu Asn Pro Asn 

50 55 60 

Asn Phe Gin Glu Val Ala Ala Asp Ser Ser Ser lie Ser Gly Ser Asn 
65 70 75 80 

Phe Lys Thr Asn Arg Lys Thr Arg Phe lie lie His Gly Phe lie Asp 

85 90 95 

Lys Gly Glu Glu Asn Trp Leu Ala Asn Val Cys Lys Asn Leu Phe Lys 

100 105 110 

Val Glu Ser Val Asn Cys lie Cys Val Asp Trp Lys Gly Gly Ser Arg 

115 120 125 

Thr Gly Tyr Thr Gin Ala Ser Gin Asn lie Arg lie Val Gly Ala Glu 

130 135 140 

Val Ala Tyr Phe Val Glu Phe Leu Gin Ser Ala Phe Gly Tyr Ser Pro 
145 150 155 160 

Ser Asn Val His Val He Gly His Ser Leu Gly Ala His Ala Ala Gly 

165 170 175 

Glu Ala Gly Arg Arg Thr Asn Gly Thr He Gly Arg lie Thr Gly Leu 

180 185 190 

Asp Pro Ala Glu Pro Cys Phe Gin Gly Thr Pro Glu Leu Val Arg Leu 

195 200 205 

Asp Pro Ser Asp Ala Lys Phe Val Asp Val He His Thr Asp Gly Ala 

210 215 220 

Pro lie Val Pro Asn Leu Gly Phe Gly Met Ser Gin Val Val Gly His 
225 230 235 240 

Leu Asp Phe Phe Pro Asn Gly Gly Val Glu Met Pro Gly Cys Lys Lys 

245 250 255 

Asn He Leu Ser Gin He Val Asp He Asp Gly He Trp Glu Gly Thr 

260 265 270 

Arg Asp Phe Ala Ala Cys Asn His Leu Arg Ser Tyr Lys Tyr Tyr Thr 
275 280 285 
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Asp Ser lie Val Asn Pro Asp Gly Phe Ala Gly Phe Pro Cys Ala Ser 

290 295 300 

Tyr Asn Val Phe Thr Ala Asn Lys Cys Phe Pro Cys Pro Ser Gly Gly 
305 310 315 320 

Cys Pro Gin Met Gly His Tyr Ala Asp Arg Tyr Pro Gly Lys Thr Asn 

325 330 335 

Asp Val Gly Gin Lys Phe Tyr Leu Asp Thr Gly Asp Ala Ser Asn Phe 

340 345 350 

Ala Arg Trp Arg Tyr Lys Val Ser Val Thr Leu Ser Gly Lys Lys Val 

355 360 365 

Thr Gly His lie Leu Val Ser Leu Phe Gly Asn Lys Gly Asn Ser Lys 

370 375 380 

Gin Tyr Glu lie Phe Lys Gly Thr Leu Lys Pro Asp Ser Thr His Ser 
385 390 395 400 

Asn Glu Phe Asp Ser Asp Val Asp Val Gly Asp Leu Gin Met Val Lys 

405 410 415 

Phe lie Trp Tyr Asn Asn Val lie Asn Pro Thr Leu Pro Arg Val Gly 

420 425 430 

Ala Ser Lys lie lie Val Glu Thr Asn Val Gly Lys Gin Phe Asn Phe 

435 440 445 

Cys Ser Pro Glu Thr Val Arg Glu Glu Val Leu Leu Thr Leu Thr Pro 
450 455 460 

Cys 
465 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 



# 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gly Pro Glu Gly Arg Leu Glu Asp Lys Leu His Lys Pro Lys Ala Thr 
15 10 15 

Cys 




(2) INFORMATION FOR SEQ ID NO:17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTTTTTTTTT TGA 13 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GATCAATCGC 10 
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(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GATTGTGCTG GCCACTTCTC 20 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 



(A) DESCRIPTION: /desc = "Oligonucleotide 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TAGGACATGC ACAGTGTAAT CTG 



23 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GACACTCCAG GGACTGAAG 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
( ix ) FEATURE : 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 36 

(D) OTHER INFORMATION: /mod_base= i 
( ix ) FEATURE : 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 37 

(D) OTHER INFORMATION: /mod_base= i 
( ix ) FEATURE : 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 41 

(D) OTHER INFORMATION: /mod_base= i 
(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 42 

(D) OTHER INFORMATION: /mod_base= i 
( ix ) FEATURE : 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 46 
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(D) OTHER INFORMATION: /mod_base= i 
( ix ) FEATURE : 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 47 

(D) OTHER INFORMATION: /mod_base= i 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CUACUACUAC UAGGCCACGC GTCGACTAGT ACGGGNNGGG NNGGGNNG 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CACACACAGG CCACGCGTCG ACTAGTAC 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
ACCACCATGG AGAGCAAAGC CCTG 



♦ 4 
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(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CCAGTTTCAG CCTGACTTCT TATTC 25 



(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GGCTGTGGAC TCAACGATGT C 21 



(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



CCGGGTGGGT AGGTACATTT TG 



22 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GGGGGTGACT TCCAGCCAGG CTGTG 25 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AACTCTGAAA GGCATGCCTG CCCGG 2 5 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 



- 162 - 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TGAAGGTCGG AGTCAACGGA TTTGGT 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CATGTGGGCC ATGAGGTCCA CCAC 



